A Simulation Model for Fault Tolerance Evaluation

نویسندگان

  • Adrian BOTEANU
  • Ciprian DOBRE
  • Adrian Boteanu
  • Ciprian Dobre
چکیده

Această lucrare prezintă un model de simulare pentru evaluarea soluţiilor de asigurare a toleranţei la defecte în sistemele distribuite de mari dimensiuni. Modelul extinde simulatorul MONARC prin adăugarea de noi funcţionalităţi pentru evaluarea toleranţei la defecte. Modelul descrie defecte ce pot apărea în astfel de sisteme şi include mecanisme pentru detecţia si corecţia acestora. În cadrul lucrării este prezentată şi o implementare pilot a modelului, împreună cu rezultatele testelor de evaluare. Au fost implementate atât defecte permanente cât şi tranziente ce pot apărea în cazul unităţilor de procesare, componentelor de reţea sau a bazelor de date. Modelul poate fi uşor extins, permiţând adăugarea de noi clase de defecte i tehnologii aferente, în funcţie de experimentul vizat. Modelul poate fi folosit pentru evaluarea performanţelor unor soluţii de toleranţă la defecte pentru sisteme distribuite, pretându-se identificării rapide a punctelor sau ariilor vulnerabile din sistemul simulat.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Reliability and Performance Evaluation of Fault-aware Routing Methods for Network-on-Chip Architectures (RESEARCH NOTE)

Nowadays, faults and failures are increasing especially in complex systems such as Network-on-Chip (NoC) based Systems-on-a-Chip due to the increasing susceptibility and decreasing feature sizes. On the other hand, fault-tolerant routing algorithms have an evident effect on tolerating permanent faults and improving the reliability of a Network-on-Chip based system. This paper presents reliabili...

متن کامل

Improving the palbimm scheduling algorithm for fault tolerance in cloud computing

Cloud computing is the latest technology that involves distributed computation over the Internet. It meets the needs of users through sharing resources and using virtual technology. The workflow user applications refer to a set of tasks to be processed within the cloud environment. Scheduling algorithms have a lot to do with the efficiency of cloud computing environments through selection of su...

متن کامل

Evaluation of composite object replication schemes for dependable server applications

Object oriented dependable server applications often rely on fault tolerance schemes, which are comprised of different replication policies for the constituent objects (composite replication schemes). This paper introduces a simulation-based evaluation approach for quantifying the tradeoffs between fault-tolerance overhead and fault tolerance effectiveness in composite replication schemes. Comp...

متن کامل

An Evolutionary Method for Improving the Reliability of Safetycritical Robots against Soft Errors

Nowadays, Robots account for most part of our lives in such a way that it is impossible for usto do many of affairs without them. Increasingly, the application of robots is developing fastand their functions become more sensitive and complex. One of the important requirements ofRobot use is a reliable software operation. For enhancement of reliability, it is a necessity todesign the fault toler...

متن کامل

Novel Defect Terminolgy Beside Evaluation And Design Fault Tolerant Logic Gates In Quantum-Dot Cellular Automata

Quantum dot Cellular Automata (QCA) is one of the important nano-level technologies for implementation of both combinational and sequential systems. QCA have the potential to achieve low power dissipation and operate high speed at THZ frequencies. However large probability of occurrence fabrication defects in QCA, is a fundamental challenge to use this emerging technology. Because of these vari...

متن کامل

Fault Tolerance Simulation and Evaluation Tool for Artificial Neural Networks

This paper presents the FTSET tool for fault tolerance evaluation and improvement of Artificial Neural Networks. Fault tolerance is a characteristic of parallel distributed systems such as neural networks. Although there is a built-in fault tolerance in neural networks, it is possible to improve this characteristic, but changing the structure of an artificial neural network to improve its fault...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011